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ABSTRACT 
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methods  for  the  solution  of  problems  in  classical  statistics  and 
probability  is  shown  by  a number  of  examples.  These  concern  testing 
of  hypotheses,  constructing  of  tolerance  regions,  planning  of  optimal 
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SIGNIFICANCE  AND  EXPLANATION 


Mathematical  programming  deals  with  optimization  under  constraints,  p.g. 
minimization  of  costs  in  manufacturing  operations  when  sources  of  supply, 
machines  and  manpower  are  limited.  Stochastic  programming  deals  with 
mathematical  programming  problems  in  which  there  is  uncertainty  associated 
with  the  variables  and  with  the  constraints  in  the  optimization  problem.  Thus 
stochastic  programming  lies  on  the  borderline  between  mathematical  programming 
and  statistics.  However,  the  connection  between  the  development  of  these  two 
sciences  is  not  strong  enough. 

The  purpose  of  this  paper  is  to  show  how  some  stochastic  programming 
methods  (developed  by  the  author)  can  be  applied  in  classical  problems  of 
statistics.  Models  are  formulated  for  a)  construction  of  statistical 
tests,  b)  construction  of  tolerance  regions,  c)  optimum  allocation  in 
surveys,  d)  the  dam  problem  of  MORAN.  The  solution  of  the  above  probltjms 
uses  nonlinear  programming  combined  with  simulation.  Evidence  concerning 
effective  solvability  of  such  problems  is  given  in  other  referenced  papers. 


The  responsibility  for  the  wording  and  views  expressed  in  this  descriptive 
summary  lies  with  MRC,  and  not  with  the  author  of  this  report. 


THE  USE  OF  STOCHASTIC  PROGRAMMING  FOR  THE  SOLUTION  OF  SOME  PROBLEMS 
IN  STATISTICS  AND  PROBABILITY 

Andras  Prekopa 
1.  Introduction 

Stochastic  programming  is  a branch  of  mathematical  programming  because  it 
deals  with  handling  of  mathematical  programming  problems  where  some  of  the 
parameters  are  random  variables.  At  the  same  time  we  may  consider  it  to  be 
part  or  extension  of  statistics  [4]  because  problems  concerning  optimization 
of  stochastic  systems  in  general  can  be  considered  to  belong  to  statistics 
in  the  wide  sense.  We  say  in  the  "wide  sense"  because  in  many  textbooks 
on  statistics  the  definition  of  this  science  includes  the  possibility  of 
gaining  information  by  experimentation  to  an  extent  depending  on  the 
statistician  (see  e.g.  the  Introduction  of  [14]).  It  is,  however,  not  always 
the  case  concerning  the  stochastic  systems  stochastic  programming  deals 
with.  If  e.g.  we  formulate  reservoir  system  design  models  using  past 
hydrological  data  then  it  is  not  possible  to  perform  further  experimentation 
unless  we  postpone  the  date  of  the  building  of  the  reservoirs.  Some 
statisticians  do  not  require  that  the  possibility  of  getting  information 
by  further  experimentation  should  be  characteristic  for  the  statistical 
methods.  Accepting  this,  it  is  in  fact  appropriate  to  say  that  stochastic 
programming  is  an  "extension  of  statistics". 

For  about  twenty  years,  however,  stochastic  programming  has  had  a 
development  almost  independent  of  the  parallel  development  of  the  statistical 
methods.  Hence  it  is  reasonable  to  show  that  some  of  the  stochastic 
programming  models  and  methods  provide  useful  tools  for  the  solution  of 
known  problems  in  classical  statistics  and  probability.  The  purpose  of 
this  paper  is  to  show  this  possibility  by  a number  of  examples. 

The  possibility  of  applying  stochastic  programming  in  statistics  and 
probability  is  easy  to  understand.  In  fact  the  decision  principles  of 
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stochastic  progrcunming  contain  ideas  taken  from  classical  statistical  theories. 
Prescribing  lower  or  upper  bound  for  probabilities  and  find  optimal  strategy 
is  a general  way  of  thinking  in  testing  hypothesis.  To  decide,  then  observe 
and  to  decide  again,  appears  to  be  the  general  decision  scheme  in  the  deci- 
sion theory  of  Wald  and  others.  To  find  optimal  strategy  by  minimizing  an 
expectation  depending  on  some  decision  variable (s)  already  appears  in  the 
work  of  D.  Bernoulli  concerning  the  solution  of  the  Petersburg  problem  [16J . 
However,  the  fact  that  similar  principles  can  be  applied  and  how  they  can  be 
applied  for  large  scale  problems  described  by  mathematical  programming 
problems  where  some  of  the  parameters  are  random,  was  by  no  means  simple 
to  discover.  Because  not  only  the  formal  application  of  the  statistical 
principles  was  necessary  to  carry  out  but  the  new  problems  turning  out 
needed  to  be  numerically  solvable  mathematical  programming  problems  and  these 
must  have  resulted  in  good  applications. 

The  problems  we  formulate  in  this  paper  are  of  "chance  constrained"  or 
in  other  words  "probabilistic  constrained"  type.  This  type  of  stochastic 
programming  model  has  been  introduced  by  Charnes,  Cooper  and  Symonds  [3] 

(the  paper  was  presented  at  the  Econometric  Society  meeting  in  Washington, 

D.C.,  1953).  In  our  models,  however,  we  frequently  impose  constraint  on 
the  joint  probability  for  the  occurrence  of  random  events  which  is  important 
in  particular  in  applications  in  statistics  and  probability  from  the  point 
of  view  of  model  construction.  At  the  same  time  this  makes  the  numerical 
solution  of  the  problem  more  complicated. 

Since  extensive  application  has  been  done  by  the  author  and  his 
collaborators  concerning  such  stochastic  programming  problems,  evidence 
exists  that  problems  of  the  type  we  are  going  to  formulate  are  numerically 
solvable  at  least  in  moderate  sizes. 
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The  problems  we  are  going  to  formulate  are  of  the  following  type 
(1.1)  minimize  f (x)  subject  to 

h^lx)  = P(g^(x,C)  > 0, . . . ,g^ (x,C)  > 0)  > p , 

h^ (x)  > 0,  i = 1 , . . . ,m  , 

where  x e g^,...,g^  are  functions  of  the  deterministic  variable  x and 

the  random  variable  C further  h, ,...,h  are  functions  of  the  deterministic  I 

1 m 

variable  x.  Problems  of  the  type  (1.1)  are  solved  by  the  combined  use  of 

I 

I 

nonlinear  programming  and  simulation  where  the  latter  is  used  to  determine 

function  values  which  are  probabilities  and  gradient  values  which  can  be  i 

expressed  in  terms  of  probabilities  concerning  h^  and  these  probabilities 
belong  to  sets  in  higher  dimensional  spaces. 

If  Problem  1 is  a convex  programming  problem  i.e.  the  set  of  feasible  solu- 
tions is  convex  and  f (x)  is  a convex  function,  then  when  solving  the  problem  we 
can  reach  global  optimum,  otherwise  we  can  only  expect  local  optimum . To  have  a convex 
programming  problem  it  is  crucial  to  Itnow  that  the  constraint  ^ P 

determines  a convex  set.  General  theorems  ensure  this  property.  For  the 
reader's  convenience  we  formulate  here  two  theorems  proved  in  [9],  [10]  and 
[1] , [2],  respectively.  For  further  references  see  [13]. 

Theorem  1 . If  g^,...,g^  are  concave  functions  in  all  variables  in  the 
entire  space  and  C has  a logarithmically  concave^  probability  density 
function,  then  h^  is  a logarithmically  concave  function  in  the  entire  space 
of  the  variable  x. 


nonnegative  point  function  h defined  on  a convex  set  A is  said  to  be 
logarithmically  concave  (logconcave)  if  for  every  x,  y e A and  0 < X < 1, 

we  have  h(Ax  + (l-X)y)  > [h  (x) ) ^ [h (y ) ] ^ 
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Theorem  2.  If  x e r'’,  C « and  are  concave  functions  in 

further  the  probability  density  function  f of  the  random  vector  C 
has  the  property  that  f is  convex  in  r'^,  then  h^  is  a quasi-concave 

function  in  R*^. 

In  some  of  the  problems  formulated  in  this  paper  we  shall  allow  more 
than  one  probabilistic  constraints. 


A function  h 
for  every  x,y 


defined  on  a convex  set  A is  said  to  be  quasi-concave 
e A and  0 < A < 1,  we  have  f (Ax  + (l-A)y)  > min(f(x) 


if 

f (y)l  . 
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2.  Testing  Statistical  Hypothesis  and  Construction 


of  Confidence  Region 

The  simplest  example  that  can  be  formulated  to  illustrate  testing 
hypothesis  is  to  test  the  probability  distribution  against  the  alterna- 

tive . For  the  sake  of  simplicity  we  assume  that  these  are  discrete 
distributions  assigning  positive  probabilities  only  to  positive  integers 
and  P^(k) , P^(k)  respectively,  designate  the  probabilities  belonging  to 
the  integer  k.  To  construct  a test  of  size  less  than  or  equal  to  the 
prescribed  probability  a and  having  maximum  power  is  equivalent  to  finding 
a set  of  integers  S such  that 

first  kind  error  = T P„(k)  < a , 

0 = 
kes 

(2.1) 

power  of  the  test  = J P,  (k)  ->•  max  . 

kes 

This  problem  can  be  formulated  as  a mathematical  programming  problem 

maximize  [ P (k)x 

kes  ^ ^ 

(2.2)  subject  to  Y P„{k)x,  < a , 

^ 0 k = 

kes 

X,  = 0 or  1 all  k. 
k 

The  optimal  set  S is  then  given  by 

S = {klx,  = 1}  . 

' k 

Problem  (2.2)  is  known  in  the  operations  research  as  the  knapsack  problem  and 
has  a simple  solution  mentioned  already  in  [7,  p.  64].  We  have  mentioned  the 
above  problem  only  in  order  to  present  a starting  point  for  the  formulation 
of  more  sophisticated  tests.  We  introduce  the  following  notations 

H:  Set  of  probability  distributions  representing  a hypothesis. 

K:  Set  of  probability  distributions  representing  the  alternative. 
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Sli 


X:  Vector  valued  random  variable.  On  the  basis  of  the  observed 
value  of  X we  reject  or  accept  the  hypothesis, 
a:  Prescribed  upper  bound  for  the  first  kind  errors. 

8:  Variable  to  be  maximized,  its  optimum  value  is  the  maximum 
value  of  the  minimum  power  of  the  test. 

S:  Critical  region. 

The  problem  is  to  find  S by  solving  the  following  problem 

maximize  8 subject  to 
P (X  e s|f)  < a for  every  F e H , 

(2.3) 

P(X  e s|f)  > 8 for  every  F e K . 

In  (2.3)  on  the  left  hand  sides  there  stand  probabilities  of  the  event  X e S 
using  the  probability  distribution  F for  the  random  vector  x. 

Problem  (2.3)  is  too  general.  We  can  hardly  find  a set  S this  way. 

We  can,  however,  restrict  the  type  of  the  set  S and  find  one  in  the 
restricted  category.  To  this  end  let  us  introduce  the  sets 

= {x|Lj^(x)  > bj^},  k = l,...,m 

m 

S = \J  S , 
k=l 

where  L,  ,...,L,  are  given  linear  forms  of  the  variable  x e R*'  and 
Ik 

b, ,...,b  are  real  parameters  the  values  of  which  we  want  to  determine  by 
1 m 

an  optimization  problem.  The  closure  of  the  complementary  of  S is  a convex 

polyhedron  for  every  fixed  values  of  the  parameters  b, ,...,b  and  through 

1 m 

the  variation  of  the  parameters  only  (some  of)  the  faces  are  shifted  (see 
Pig.  1) . T.  .j  assume  that  all  probability  distributions  in  H and  K are 
continuous.  This  implies  that  each  of  them  assigns  0 probability  to  any 
hyperplane.  Knowing  this,  we  can  write 


-6- 


Figure  1 


P(X  6 S|P)  = 1 - P(Lj^(X)  < bj^,  k = l,...,m|F) 
for  every  F e H . 

This  is  the  probability  in  the  first  constraint  of  Problem  (2.3).  Th 

constraint  itself  can  be  written  in  the  following  manner: 

(2.4)  P(L,  (X)  < b,  , k = l,...,m|F)  > 1 - a for  F e H . 

k = k = 


If  every  F 6 H has  a logconcave  density  or  a density  satisfying  the 


condition  of  Theorem  2,  then  the  set  of  vectors  (b, ) satisfying 

1 m 

(4)  is  a convex  set.  This  is  ensured  by  Theorem  1 and  2,  whichever  applies. 

In  most  of  the  cases  we  cannot  expect  that  the  second  constraint  in 

Problem  (2.3)  in  this  joint  probabilistic  constraint  will  determine  a 

convex  set  of  the  (b, ,...,b  ) vectors.  Therefore  instead  of  the  single 
1 m 

second  constraint  we  shall  use  constraints  imposing  lower  bounds  for  the 
single  probabilities.  These  are  the  following 

(2.5)  P(L,  (X)  > b,  |f)  > B,  for  k = l,...,m  and  F e K , 

k = k = k 


where  B. , . . . , 6 will  also  be  handled  as  variables.  Putting  B,  = e 


m 


k = l,...,m  we  formulate  our  problem  in  the  following  manner 

minimize  (y,  + •••  + Y ) subject  to 
1 m 


(2.6) 


P(Lj^(X)  < bj^,  k = l,...,m|F)  > 1 - a for  F e H , 


e P(Lj^(X)  > 1 ^ “ l,...,m  and  F e K , 


> 0,  k = 1, . . . ,m  . 


If  every  probability  distribution  in  H U K satisfies  the  condition  of 
Theorem  1 or  Theorem  2,  then  (2.6)  is  a convex  programming  problem.  Minimizing 

e 

k=l 


instead  of  the  sum  of  , the  convex  programming  character  of  the 

1 m 

problem  will  not  be  disturbed. 

Similar  problem  can  be  formulated  for  the  construction  of  confidence 

region.  In  this  case  we  use  the  first  constraint  of  Problem  (2.6),  impose  some 

other  constraints  on  b-,...,b  (these  can  be  e.g.  lower  and  upper  bounds) 

1 m 

and  minimize  a certain  function  of  the  variables  b. ,...,b  . Thus  we  are 

1 m 

lead  to  the  following  problem: 
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minimize  f (b)  subject  to 

(2.7)  P(L,  (X)  < b . k = l,...,m|F)  > 1 - a for  F e H , 

k = k = 

(b)  > 0,  i = 1, . . . ,M  . 

If  every  F in  H satisfies  the  condition  of  Theorem  1 or  Theorem  2,  then 

the  first  constraint  determines  a convex  set  of  the  b vectors.  Then,  if  f 

is  convex  and  h, , . . . ,h  , are  concave  or  quas i -concave , Problem  (2.7)  will 
1 M 

be  a convex  programming  problem.  The  confidence  region  will  be  the  inter- 
section of  the  sets  S, , . . . ,S  using  the  optimal  b, , . . . ,b  produced  by 

1 m 1 m 

Problem  (2.7) . The  above  notion  of  a confidence  region  is  slightly  different 
from  the  conventional  one.  The  reason  is  that  we  work  with  general  sets  of 
probability  distributions  and  disregard  the  parameter.  It  is  not  difficult, 
however  to  specialize  our  problem  formulation  for  the  case  when  we  have 
parameter  and  to  derive  this  way  confidence  region  in  the  conventional  sense 
for  the  unknown  parameter . 
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3.  Construction  of  Tolerance  Regions 


Let  F be  the  probability  distribution  of  the  random  vector 

X = (X  , . . . ,X  ) . The  random  set  S is  a 6-content  tolerance  region  of 
IN 

confidence  level  y if  the  following  inequality  holds 

(3.1)  P(/  dF  > 6)  > Y • 

S “ “ 

The  random  set  S is  constructed  on  the  basis  of  a sample  taken  from  the 
population  F.  The  probability  standing  on  the  left  hand  side  in  (3.1)  is 
called  the  coverage  of  S. 

Instead  of  a general  formulation  of  the  construction  of  a tolerance 
region  based  on  stochastic  programming,  we  show  the  principal  idea  on  a simple 
problem . 

Suppose  we  want  to  construct  tolerance  region  of  the  form  S = (0,Kx) 
for  an  exponential  distribution  with  unknown  parameter  X where  x is  the 
empirical  mean  of  a sample  of  size  n taken  from  this  population  and  K 
is  a number  to  be  determined  so  that  it  should  be  the  smallest  number 
satisfying  (3.1)  with  given  6 and  y.  In  our  case  (3.1)  can  be  written 
in  the  following  manner 

(3.2)  P(1  - e > 6)  > Y • 

Let  us  introduce  the  notation  y = Xx.  The  probability  distribution  of  y 
is  independent  of  X and  ny  has  a standard  gamma  distribution  with  parcimeter 
n i.e.  ny  has  the  following  probability  density  function: 

=77^ifr  if  ^ ^ 0 

and  f(z)  = 0 otherwise.  The  inequality  (3.2)  can  be  rewritten  as  follows: 

(3.3)  P(ny  > ^ log  ^ ) > y . 
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1_ 

L 


It  will  be  more  convenient  to  use  the  new  variable  L = Puttinq  K = 

K 

in  (3.3),  our  problem  will  be  to  find  the  largest  L satisfying  (3.3). 

Assume  now  that  we  have  m independent  exponentially  distributed  random 

variables  with  parameters  A ,...,A  , respectively.  Let  x, , . . . ,y  denote 

1 m 1 m 

the  sampling  expectations  corresponding  to  independent  samples  of  sizes 

n-,...,n  ta)cen  from  these  populations  and  y.  =Ax.,  i = l,...,m.  For 
1 m 11 

the  construction  of  a tolerance  region  of  the  form  (x  means  Cartasian 
product) : 


(3.4) 


(0,  X ) X (0,  X ) X X (0,  X ) 

L-  X L ^ L in 

12  m 


it  is  reasonable  to  choose  the  following  decision  principle 


maximize  (L^  + 


+ L ) subject  to 
m 


(3.5)  ] f P(n.y.  > L.n.  log  -)  > y » 

11=11  1-6  = 

1=1 

a.  <L.  <b.,  i=l,...,m, 

1 = 1 = 1 

where  a^,...,a^;  b^,...,b^  are  prescribed  bounds.  Since  a standard  gamma 

density  function  with  parcimeter  greater  than  or  equal  to  1,  is  a logconcave 

point  function,  it  follows  from  Theorem  1 that  the  constraining  function  i.e. 

the  function  standing  on  the  left  hand  side  of  the  first  constraint  of 

Problon  (3.5),  is  a logconcave  function  of  the  variables  L, ,...,L  . Thus 

1 m 

(3.5)  is  a convex  programming  problem. 

It  is  not  necessary  to  restrict  ourselves  to  the  case  of  independent 
random  variables.  Consider  e.g.  the  random  variables 


^1=1^  <==1  + z^)  , 


^2  = ^ (^1  + Z3)  , 
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where  X ,X  are  positive  (unknown)  constants  and  z ,z  ,z  are  standard 

± A X ^ j 

gamma  distributed  random  variables  with  known  parameters 

we  take  three  independent  samples  concerning  them  of  size 

n and  use  the  notations  ^i'^2'^3  sample  means,  further 

= Xz^,  = Xz^,  = Xz^,  then  we  can  formulate  the  following  stochastic 
programming  problem 

maximize  (L^  + L^)  subject  to 


(3.6) 


n(y,  + y.)  > nL  log 


rKy^  + y3)  > nL^  log 


> y 


a.  <L.  <b.,  i=l,2  . 

1 = 1 = 1 

The  random  variables  ny^^/ny^/ny^  are  independent  and  they  have  standard 
gamma  distributions  with  parameters  nd^jnd^jnd^.  If  nd^  > 1,  nd^  > 1, 
nd^  > 1,  then  ny^^jny^/ny^  have  a logconcave  joint  density.  Hence  by 
Theorem  1,  the  constraining  function  in  the  first  constraint  of  Problem  (3.6) 
is  a logconcave  function  of  the  variables  Thus  (3.6)  is  a convex 

programming  problem.  The  tolerance  region  will  be  that  special  case  of 


(3.4)  where  m = 2 and  ^j^'^2  components  of  an  optimal  solution 

of  Problem  (3.6). 
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4.  Optimum  Allocation  in  Surveys 


A well-known  application  of  nonlinear  programming  is  to  find  the  number 
of  elements  in  the  different  strata  concerning  stratified  sampling  from  a 
finite  population.  First  we  formulate  this  problem  that  is  a deterministic 
one,  then  using  it  as  an  underlying  deterministic  problem,  we  formulate  a 
stochastic  programming  problem.  Let  us  introduce  the  following  notations: 

L Number  of  strata 

N,  Elements  in  stratum  h 

h 

L 

N = )]  N Total  number  of  elements  in  the  population 

h=l  ^ 


to  have  linear  constraints,  we  prefer  to  use  the  variables  X . 

X L 


Since 


n = 1/(X  + — ) , 

h h N, 

h 


our  problem  reads  as  follows 


minimize  T C,/(X,  + — ) subject  to 
, . h h N, 

h=l  h 


h=l 


hj  h 


0 < X^  < 1 
= h = 


N, 


j = 1, . . . ,r  , 


h ~~  1, ■ 


In  this  problem  the  constraints  are  linear  and  the  objective  function  to  be 
minimized  is  convex.  Hence  (4.1)  is  a convex  programming  problem. 

Assume  now  that  within  the  strata  we  have  such  populations  the  variances 
of  which  are  random  variables.  Then  we  can  impose  a probabilistic  constraint 
on  the  first  p constraints  of  Problem  (4.1)  and  formulate  the  following 
new  problem 

V 1 

minimize  ) C,  (X,  + ; — ) subiect  to 

1-1  h h N, 
h=l  h 


(4.2) 


^ \j\  i ‘^j'  ^ ^ ^ P 


h=l 


0 ^ 1 h 1 L • 

= h = N, 

h 


We  may  take  a small  sample  before  and  use  the  aposteriori  distribution 
of  the  coefficients  a,^  given  the  result  of  the  small  sample.  In  this 
case  the  structure  of  the  problem  (4.2)  remains  but  we  have  new  probability 
distribution  for  the  random  variables  in  the  first  constraint. 

Problems  of  the  type  (4.2)  are  frequently  nonconvex.  Here  the 
coefficients  of  the  unknowns  are  random  in  the  probabilistic  constraint. 
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Some  result  concerning  programminq  under  probabilistic  constraint  with 
random  technology  matrix  are  presented  in  f5],  [111.  According  to  these 
results  still  in  many  cases  (4.2)  will  be  a convex  programming  problem. 


5.  An  Example  Concerning  Stochastic  Processes 


We  consider  the  Moran  model  for  the  dam  [8]  and  see  how  stochastic 

programming  can  improve  this  model. 

Time  will  be  subdivided  into  discrete  periods  and  we  number  them  by 

1,2,...  . Let  K be  the  capacity  of  the  dam.  Assume  that  in  the 

beginning  of  Period  i an  input  occurs:  out  of  a total  input  that 

amount  for  which  we  have  freeboard,  fills  up  the  reservoir  to  that  extent 

and  the  remaining  water  overflows.  After  this,  an  output  (release)  occurs. 

We  release  an  amount  equal  to  M if  at  least  that  amount  is  available  and 

we  release  the  total  amount  from  the  dam  if  the  available  amount  is  smaller 

than  M.  Let  z.  denote  the  water  content  of  the  reservoir  at  the  end  of 
1 

Period  i.  The  following  recursive  relation  holds  true 

(5.1)  = max(min(z^_^  + x^,K)  - M,0],  i = 1,2,... 

where  z^  is  the  initial  water  content  of  the  reservoir.  All  demands  will 
be  met  in  the  course  of  the  first  n periods  if  and  only  if  the  following 
relations  hold: 

(5.2)  z.  = min(z.  , + x.,K)  - M > 0,  i = l,...,n  . 

If  x^/x^,...  are  independent,  identically  distributed  random  variables, 
then  form  a Mar)cov  chain.  Under  mild  conditions  we  have  ergodicity 

(81  and  using  the  stationary  limit  distribution,  a reservoir  capacity  design 
principle  can  be  formulated  so  that  we  put 

(5.3)  P(min(z^  ^ + x^,K)  > M)  = p , 

where  p is  a prescribed  high  probability. 

We  now  drop  the  condition  imposed  on  the  random  variables  x^,X2,... 

and  assume  only  that  concerning  a finite  subsequence  x, ,...,x  , the 

1 n 
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condition  of  Theorem  1 or  Theorem  2 is  fulfilled.  We  also  drop  the  condition 


that  the  amount  of  water  to  be  released  is  constant  and  does  not  depend  on 

the  period.  Moreover,  we  shall  introduce  the  water  quantities  M - . . . ,M 

1 n 

to  be  released  in  the  subsequent  periods  as  unknowns.  In  order  to  obtain 
the  recursive  relations  for  this  case  we  only  have  to  write  instead 

of  M in  (5.1).  Wri'-ing  M^  instead  of  M in  (5.2)  we  obtain  a necessary 
and  sufficient  condition  that  all  demands  can  be  met  in  the  course  of  the  n 
periods . 

For  the  determination  of  the  capacity  K and  M, we  formulate 

1 n 

the  following  stochastic  programming  problem: 

minimize [c (K)  - c,M,  - •••  - c M ] subject  to 
11  n n 

(5.4)  P(min(z^_^  + x^,K)  “ > 0,  i = l,...,n)  > p , 

0 < K < Kq 

0 < Mi  < Mip,  i = 1, . . . ,n  , 

where  p is  a prescribed  high  probability,  *^o'^iO'  ^ ~ l,...,n  are  given 

constants,  c(K)  is  the  building  cost  of  the  reservoir  and  c, ,...,c 

In 

are  the  benefits  of  water  units  in  the  subsequent  periods.  We  can  e.g. 

assume  that  winter  moisture  fills  up  completely  the  reservoir  thus  Periods 

l,...,n  are  some  few  months  in  the  spring,  summer  and  fall.  If  the  reservoir 

serves  for  50  years,  say,  then  the  building  cost  should  be  subdivided  into 

50  equal  parts  in  a discounted  form  and  use  only  the  first  year  part  as 

c(K)  to  have  a right  economic  formulation  of  the  problem.  This  is,  however, 

. not  a central  point  of  our  present  discussion. 

Problem  (5.4)  is  a convex  programming  problem  if  c(K)  is  a convex 

function  and  the  joint  distribution  of  x ,...,x  satisfies  the  condition 

1 n 

of  Theorem  1 or  Theorem  2. 

i 
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For  further  reservoir  system  design  models  based  on  stochastic 
programming  the  reader  is  referred  to  [10],  [12]. 
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